Picture for Zhouhong Gu

Zhouhong Gu

Deep Research as Rubric for Reinforcement Learning

Add code
May 31, 2026
Viaarxiv icon

Scaling Behavior of Single LLM-Driven Multi-Agent Systems

Add code
May 30, 2026
Viaarxiv icon

ScholarGym: Benchmarking Deep Research Workflows on Academic Literature Retrieval

Add code
Jan 29, 2026
Viaarxiv icon

MARO: Learning Stronger Reasoning from Social Interaction

Add code
Jan 18, 2026
Viaarxiv icon

AgentGroupChat-V2: Divide-and-Conquer Is What LLM-Based Multi-Agent System Need

Add code
Jun 18, 2025
Viaarxiv icon

CompBench: Benchmarking Complex Instruction-guided Image Editing

Add code
May 18, 2025
Viaarxiv icon

LITE: LLM-Impelled efficient Taxonomy Evaluation

Add code
Apr 02, 2025
Viaarxiv icon

RECKON: Large-scale Reference-based Efficient Knowledge Evaluation for Large Language Model

Add code
Apr 01, 2025
Figure 1 for RECKON: Large-scale Reference-based Efficient Knowledge Evaluation for Large Language Model
Figure 2 for RECKON: Large-scale Reference-based Efficient Knowledge Evaluation for Large Language Model
Figure 3 for RECKON: Large-scale Reference-based Efficient Knowledge Evaluation for Large Language Model
Figure 4 for RECKON: Large-scale Reference-based Efficient Knowledge Evaluation for Large Language Model
Viaarxiv icon

ToReMi: Topic-Aware Data Reweighting for Dynamic Pre-Training Data Selection

Add code
Apr 01, 2025
Figure 1 for ToReMi: Topic-Aware Data Reweighting for Dynamic Pre-Training Data Selection
Figure 2 for ToReMi: Topic-Aware Data Reweighting for Dynamic Pre-Training Data Selection
Figure 3 for ToReMi: Topic-Aware Data Reweighting for Dynamic Pre-Training Data Selection
Figure 4 for ToReMi: Topic-Aware Data Reweighting for Dynamic Pre-Training Data Selection
Viaarxiv icon

GAPO: Learning Preferential Prompt through Generative Adversarial Policy Optimization

Add code
Mar 26, 2025
Viaarxiv icon